Geometric margin domain description with instance-specific margins
نویسنده
چکیده
Support vector domain description (SVDD) is a useful tool in data mining, used for analysing the within-class distribution of multi-class data and to ascertain membership of a class with known training distribution. An important property of the method is its inner-product based formulation, resulting in its applicability to reproductive kernel Hilbert spaces using the “kernel trick”. This practice relies on full knowledge of feature values in the training set, requiring data exhibiting incompleteness to be pre-processed via imputation, sometimes adding unnecessary or incorrect data into the classifier. Based on an existing study of support vector machine (SVM) classification with structurally missing data, we present a method of domain description of incomplete data without imputation, and generalise to some times of kernel space. We review statistical techniques of dealing with missing data, and explore the properties and limitations of the SVM procedure. We present two methods to achieve this aim: the first provides an input space solution, and the second uses a given imputation of a dataset to calculate an improved solution. We apply our methods first to synthetic and commonly-used datasets, then to non-destructive assay (NDA) data provided by a third party. We compare our classification machines to the use of a standard SVDD boundary, and highlight where performance improves upon the use of imputation.
منابع مشابه
A Unified View of Multi-Label Performance Measures
Multi-label classification deals with the problem where each instance is associated with multiple class labels. Because evaluation in multi-label classification is more complicated than singlelabel setting, a number of performance measures have been proposed. It is noticed that an algorithm usually performs differently on different measures. Therefore, it is important to understand which algori...
متن کاملBiological impact of geometric uncertainties: what margin is needed for intra-hepatic tumors?
BACKGROUND To evaluate and compare the biological impact on different proposed margin recipes for the same geometric uncertainties for intra-hepatic tumors with different tumor cell types or clinical stages. METHOD Three different margin recipes based on tumor motion were applied to sixteen IMRT plans with a total of twenty two intra-hepatic tumors. One recipe used the full amplitude of motio...
متن کاملمقایسه آزمایشگاهی اثر Rebonding بر ریزنشت ترمیمهای کلاس V کامپوزیت با استفاده از دو رزین با ویسکوزیته پایین
Background and Aims: Although composite resin restorations have many advantages, they can lead to several clinical problems. The primary reason for these problems is microleakage. The aim of this study was to compare the rebonding effect on microleakage of class V composite restorations using two low viscosity resins.Materials and Methods: In this in vitro study, 60 class V composite restoratio...
متن کاملDimension and Margin Bounds for Reflection-invariant Kernels
A kernel over the Boolean domain is said to be reflection-invariant, if its value does not change when we flip the same bit in both arguments. (Many popular kernels have this property.) We study the geometric margins that can be achieved when we represent a specific Boolean function f by a classifier that employs a reflectioninvariant kernel. It turns out ‖f̂‖∞ is an upper bound on the average m...
متن کاملDetermination of Gain and Phase Margins in Lur’e Nonlinear Systems using Extended Circle Criterion
Nonlinearity is one of the main behaviors of systems in the real world. Therefore, it seems necessary to introduce a method to determine the stability margin of these systems. Although the gain and phase margins are established criteria for the analysis of linear systems, finding a specific way to determine the true value of these margins in nonlinear systems in general is an ongoing research i...
متن کامل